Compound Key Word Generation from Document Databases Using A Hierarchical Clustering ART Model
نویسندگان
چکیده
منابع مشابه
Compound Key Word Generation from Document Databases Using A Hierarchical Clustering ART Model
The growing availability of databases on the information highways motivates the development of new processing tools able to deal with a heterogeneous and changing information environment. A highly desirable feature of data processing systems handling this type of information is the ability to automatically extract its own key words. In this paper we address the specific problem of creating sema...
متن کاملDocument Clustering using Compound Words
Document clustering is a kind of text data mining and organization technique that automatically groups related documents into clusters. Traditionally single words occurring in the documents are identified to determine the similarities among documents. In this work, we investigate using compound words as features for document clustering. Our experimental results demonstrate that using compound w...
متن کاملLearning and Representing Topic A Hierarchical Mixture Model for Word Occurrences in Document Databases
ion levels of words document partitioning abstraction levels (a) (b)
متن کاملHierarchical Document Clustering Using Correlation Preserving Indexing
This paper presents a spectral clustering method called as correlation preserving indexing (CPI). This method is performed in the correlation similarity measure space. Correlation preserving indexing explicitly considers the manifold structure embedded in the similarities between the documents. The aim of CPI method is to find an optimal semantic subspace by maximizing the correlation between t...
متن کاملHierarchical Document Clustering using Frequent Itemsets
A major challenge in document clustering is the extremely high dimensionality. For example, the vocabulary for a document set can easily be thousands of words. On the other hand, each document often contains a small fraction of words in the vocabulary. These features require special handlings. Another requirement is hierarchical clustering where clustered documents can be browsed according to t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Intelligent Data Analysis
سال: 1997
ISSN: 1571-4128,1088-467X
DOI: 10.3233/ida-1997-1103